Segmentation of the Yellow Pages

نویسندگان

  • Stephen Fischer
  • Adnan Amin
  • D. Drivas
چکیده

We present a fully automated process to scan the Australian Telecom Yellow Pages and produce a text document consisting only of the business entries, while removing the advertisements, graphics, and notes about the Yellow Pages. The system contains four major components: digitisation and thresholding, skew detection, segmentation (removal of unwanted parts of the image), and finally the recognition engine utilising the principles of mathematical morphology. This paper presents the current research, which consists of the process described above up to image segmentation. All the algorithms are written in C on a 5000/20 DEC workstation. We have tested more than 30 images with extremely

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Figuring Out What the User Wants: Steps Toward an Automatic Yellow Pages Assistant

An experimental system, AYPA, for automatic Yellow Pages assistance is described. The system, which operates in the domain of automobiles, automobile parts, and related objects, reads the user's request in simple English, analyzes it and represents it in terms of the system's conceptual primitives. From this, the system tries to figure out the intent of the request and formulate a Yellow Pages ...

متن کامل

Automatic Yellow-Pages pagination and layout

The compact and harmonious layout of ads and text is a fundamental and costly step in the production of commercial telephone directories ("Yellow Pages"). We formulate a canonical version of Yellow-Pages pagination and layout (YPPL) as an optimization problem in which the task is to position ads and text-stream segments on sequential pages so as to minimize total page length and maximize certai...

متن کامل

Web pages segmentation for document selection in Question Answering (Pré-segmentation de pages web et sélection de documents pertinents en Questions-Réponses) [in French]

Dans cet article, nous présentons une méthode de segmentation de pages web en blocs de texte pour la sélection de documents pertinents en questions-réponses. La segmentation des documents se fait préalablement à leur indexation en plus du découpage des segments obtenus en passages au moment de l’extraction des réponses. L’extraction du contenu textuel des pages est faite à l’aide d’un extracteu...

متن کامل

Quantitative Comparison of SPM, FSL, and Brainsuite for Brain MR Image Segmentation

Background: Accurate brain tissue segmentation from magnetic resonance (MR) images is an important step in analysis of cerebral images. There are software packages which are used for brain segmentation. These packages usually contain a set of skull stripping, intensity non-uniformity (bias) correction and segmentation routines. Thus, assessment of the quality of the segmented gray matter (GM), ...

متن کامل

Pharewell to Phishing : Secure Direction and Redirection over the Web

The conventional wisdom has always been that users should refrain from entering their sensitive data (such as usernames, passwords, and credit card numbers) into http(or white) pages, but they can enter these data into https (or yellow) pages. Unfortunately, this assumption is not valid as it became clear recently that, through human mistakes or Phishing or Pharming attacks, a displayed yellow ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995